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We investigate the dynamics of DNA translocation through a nanopore using 2D Langevin dy- 
namics simulations, focusing on the dependence of the translocation dynamics on the details of 
DNA sequences. The DNA molecules studied in this work are built from two types of bases A and 
C, which has been shown previously to have different interactions with the pore. We study DNA 
with repeating blocks AnCn for various values of n, and find that the translocation time depends 
strongly on the block length 2n as well as on the orientation of which base entering the pore first. 
Thus, we demonstrate that the measurement of translocation dynamics of DNA through nanopore 
can yield detailed information about its structure. We have also found that the periodicity of the 
block sequences are contained in the periodicity of the residence time of the individual nucleotides 
inside the pore. 

PACS numbers: 87.15.Aa, 87.15.He 
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The translocation of biopolymers across membranes is 
ubiquitous in biological systems, such as DNA and RNA 
translocation across nuclear pores, protein transport 
through membrane channels, and virus injection into 
cells. In a seminal experimental paper, Kasianowicz et 
al. [l| demonstrated that an electric field can drive single- 
stranded DNA and RNA molecules through the water- 
filled a-hemolysin channel and that the passage of each 
molecule is signaled by a blockade in the channel cur- 
rent, whose magnitude and duration depend on the struc- 
ture of the DNA or RNA molecule. Similar experiments 
have been done recently using solid state nanopores 
with more precisely controlled dimensions. Triggered by 
these experiments and potential technological applica- 
tions [l|, y , such as rapid DNA sequencing, gene ther- 
apy and controlled drug delivery, the translocation of 
biopolymers through nanopore has become a subject of 
intensive experimental 3, 4, 5 , Im LZ , [SJ and theoretical 
i, 9, 10, 11, 12, 13, 14, 15, 16, iJSSHim studies. 
A particular important question is if DNA translocation 
through a nanopore can be used to determine the detailed 
sequence structure of the molecule [2|, [SJ, |j, ll8| . 

It has been demonstrated in experiments [J, |5[ that 
translocation through a-hemolysin pore can be used to 
discriminate between polydeoxyadenylic acid (poly(dA)) 
and polydcoxycytidylic acid (poly(dC)) molecules of the 
same chain length. The translocation time of poly(dA) 
is found to be longer, and its distribution is wider with 
a longer tail compared with the corresponding data for 
poly(dC). The different behavior was attributed to dif- 
ferent interactions of the nucleotides with the pore, with 
the base A having a stronger attractive interaction with 
the pore than the base C. These experimental findings 
and conclusions were quantitatively supported by recent 
Langevin dynamics (LD) simulations [l9| with a model 



for the DNA molecules incorporating different base-pore 
interactions. 

Inspired by the ability to discriminate poly (d A) and 
poly(dC) with the same chain length, Meller et al. [J, 
|5| have studied different behavior of the transloca- 
tion time distribution for the heteroDNA molecules 
poly(dA5odC,5o) and poly(dAdC)5o. The result suggests 
that translocation through a nanopore can distinguish 
between DNA polynucleotides of similar length with com- 
positions that differ only in detailed sequence structure. 
In this paper we seek to shed light on this important ques- 
tion. We adopt a model for the hetero-DNA molecules 
with different base-pore interactions and investigate the 
sequence dependence of their translocation dynamics us- 
ing LD simulations. 

In our model, a single stranded DNA molecule is rep- 
resented as a bead-spring chain. A short range repul- 
sive Lennard- Jones (LJ) potential Ul.j{t) = 4e[(^) — 
{^f] + £ for r < 2i/V and for r > 2^/^a exists be- 
tween all beads leading to excluded volume interaction. 
Here, a is the diameter of a bead, and e is the depth 
of the potential. Neighboring beads are connected by a 
Finite Extension Nonlinear Elastic (FENE) spring with 
interaction energy UpENEir) = — ^A:i?Qln(l — t^/Rq), 
where r is the distance between consecutive monomers, 
k is the spring constant and Rq is the maximum allowed 
separation between connected monomers. We consider 
a 2D geometry as shown in Fig. 1, where the wall of 
thickness L is formed by columns of stationary particles. 
A pore of length L and width W in the center of the 
wall connects the cis and the trans sides and a voltage 
is applied across the pore to drive the negatively charged 
DNA through the pore. Between all bead-wall particle 
pairs, there exist the same short range repulsive LJ inter- 
action as described above. The pore-bead interaction is 



modeled by a LJ potential with a cutoff of 2.5(t and inter- 
action strength SpA for the base A and Epc for the base 
C. Each bead is subjected to conservative, frictional, 
and random forces, respectively, leading to the equation 
of motion mVi = -V{Ulj + Ufene) + Text - S.^i + Ff , 
where m is the bead's mass, ^ is the friction coefficient, v.^ 
is the bead's velocity, and Ff is the random force which 
satisfies the fluctuation-dissipation theorem 221 . The ex- 
ternal force due to the applied voltage is represented by 
Fcxt = Fx. 

The LJ parameters e, a and the bead mass fix the sys- 
tem energy, length and mass units respectively, leading 
to the corresponding time scale t^j = {raa"^ jeYl'^ and 
force scale eju. In our model, each bead corresponds 
to a Kuhn length of a single-stranded DNA containing 
approximately three nucleotide bases, so the value of 
(T '^ 1.5 nm [23|. The average mass of a base in DNA 
is about 312 amu, so the bead mass m « 936 amu. We 
set fcsT = 1.2e, which means the interaction strength 
e to be 3.39 x 10^^^ J at actual temperature 295 K. 
This leads to a time scale of 32.1 ps and a force scale 
of 2.3 pN. The dimensionless parameters in the model 
are then chosen to be i?o = 2, fc = 7, ^ = 0.7, L = 5 
and W ^ 3, and F ~ 0.5. Each base (nucleotide) is es- 
timated to have an effective charge of 0.094e from ref. lZI, 
leading to an effective charge of a bead being 0.282e. 
Thus, F = 0.5 corresponds to a voltage of about 187.9 
mV across the pore within the range of experimental pa- 
rameters [ll I2I, y, |5|, |6[ . The choice of W^ = 3 ensures that 
the average interaction of both bases A and C with the 
pore are attractive. The pore-base interactions SpA = 3.0 
and Epc = 1.0 are chosen based on comparison of the 
theoretical results [l9| with the experimental data \^ 
for the transloation time distribution histogram of the 
homoDNA molecules poly(dC)ioo and poly(dA)ioo- The 
Langevin equation is integrated in time by a method de- 
scribed by Ermak and Buckholz [2J| in 2D. Initially, the 
first monomer of the chain is placed in the entrance of 
the pore, while the remaining monomers are under ther- 
mal collisions described by the Langevin thermostat to 
obtain an equilibrium configuration. 

We consider the sequence dependent translocation re- 
sults for DNA of chain length N = 128 with the sym- 
metric blocks AnCn having block length M = 2n, with 
minimum value of n = 1 for poly(dAdC)64 and max- 
imum value oi n ~ N/2 for poly(dA64dC64). Fig. 2 
shows the translocation time r as a function of the block 
length. The translocation time is obtained as the time 
interval between the entrance of the first bead into the 



pore and the exit of the last bead [25[ . Typically, we aver- 
age our data over 2000 independent runs. The horizontal 
dashed, dotted and dash-dotted hues correspond to ta, 
(ta + Tc)/2 and tc, respectively. Here, ta and tc are 
the translocation times for poly(dA)i28 and poly(dC)i28, 
respectively. For M < 8, r is close to tc and much 
lower than {ta + tc)/2, with very weak dependence on 




FIG. 1: A schematic representation of the system. The pore 
length L = 5 and the pore width W = 3. 
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FIG. 2: Translocation time as a function of the block length 
for multi-block DNA with symmetric repeat units AnC„. 
Here the block length M = 2n, SpA = 3.0, Spc = 1.0, F = 0.5, 
and the chain length A'' = 128. The insert shows r as a func- 
tion of the block length for multi-block DNA with symmetric 
repeat units AnCn- Here, we define r as the ratio of translo- 
cation times for the base C entering the pore first and the 
base A entering the pore first. 



the block length and the orientation of which monomer 
enters the pore first. 

However, r begins to increase rapidly with M for 
M > 8, approaches a maximum between M ~ 16 and 
32, and finally decreases slowly with increasing M. In 
addition, r also depends strongly on the polymer orien- 
tation. It is always much longer for the base A entering 
the pore first than the other orientation. Quantitatively, 
we define r as the ratio of translocation times for the 
base C entering the pore first and the base A entering 
the pore first. The insert of Fig. 2 shows r as a function 
of the block length. For A/ < 4, r = 1, but for longer M, 
r increases exponentially with M. For poly(dAg4dC64), 

Qualitatively, the large block length results can be un- 
derstood by examining the extreme case of the largest 
block M = 128. When the Cq4 block is translocated first 



through the pore, subsequent forward motion is energeti- 
cally unfavored because of the strong attraction between 
the pore and the base A. As a result, frequent backward 
transitions occur which slows down the overall transloca- 
tion process. In the opposite orientation when Aq4 block 
goes through the pore first, the difference of probabilities 
between the forward and the backward steps of the re- 
maining motion is smaller, leading to a value r > 1. Sim- 
ilar behaviors have been recently analyzed by Kotsev and 
Kolomeisky [20| where the translocation of polymers con- 
sisting of a double-stranded block and a single-stranded 
block is considered, and by Tsuchiya and Matsuyama 21 1 
where they studied the translocation of an amphiphilic 
polymer. 

For the base C entering the pore first, r :$> {ta + tc)/2 
for M > 16. It is a surprise that t > ta for 16 < 
M < 64. For poly(dA), the frequency of backward and 
forward motion is much slower than that for poly(dC). 
Incorporating the base C with a suitable block length 
into poly(dA) will increase the frequency of backward 
and forward motion when the C block is in the pore. As 
a result, the translocation time is larger than ta- For the 
base A entering the pore first, t > tc and for 16 < M < 
32, t>{ta+ tc)/2. 

Meller et al. |j| have studied blockade signals for the 
heteroDNAs poly(dA5odC5o) and poly(dAdC)5o. The 
translocation events are organized into two well-localized 
groups with different blockage currents, the origin of 
which is yet uncertain [26|. Direct comparison with 
the present study is further complicated by the fact 
that poly(dA) molecules have a higher tendency to form 
single-stranded base-stacked helices as compared with 
poly(dC) [2, y, although the base-pore interaction ef- 
fect is still expected to be dominant. The group 2 data 
at low temperatures show that the translocation time for 
poly(dA5odC5o) is longer than that for poly(dAdC)5o, in 
agreement with our finding here that larger block length 
of repeat unit leads to a greater translocation time, as 
shown in Fig. 2. It would be desirable to have fu- 
ture experimental tests for the orientation dependence of 
the translocation for these heteropolymers as well. We 
have also studied the histograms of translocation time for 
the hetero-DNAs poly(dA„dC„). For short block length 
M = 2n, the histograms depend only weakly on the ori- 
entation, shown in Fig. 3(a), and the behavior is close 
to that of poly(dC). However, for longer block lengths, 
the histogram deviates markedly from a Gaussian with a 
long exponential tail as shown in Fig. 3(b). This behav- 
ior is in agreement with the experimental observation [J] . 
There is also a strong orientational dependence, with the 
histogram for C entering first shifted to longer translo- 
cation times. 

We have also investigated the distribution for waiting 
(residence) time of base s defined as the time between the 
events that the base s and the base s + 1 exit the pore. 
We find that the residence times for the ordered DNA 
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FIG. 3: Histogram of the translocation times for (a) 
poly(dAdC)64 and (b) poly(dA64dC64) under F = 0.5. 
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FIG. 4: Waiting times for (a) poly(dAi6dCi6)4 and (b) 
poly(dA64dC64) under F = 0.5. 



with repeat units A„C„ (for n > 2) exhibit "fringes" 
reminiscent of optical interference pattern, as shown in 
Fig. 4. The number of peaks is exactly equal to N/2n. 
The periodicity of the waiting time not only depends on 
the block length but also the orientational property of the 
chain as well. For the sequence j4„C„ with A entering 
the pore first [ea > ec)j the residence time is symmetric 
with respect to the center of the chain containing exactly 



N /An maxima on either side, whereas with C entering the 
pore first has {N/2n — l) maxima with the last maximum 
ending with the largest waiting time. 

The sequence dependence of the waiting time distri- 
bution yields a better understanding for the sequence 
dependence of r shown in Fig. 2. The translocation 
time can be written as r ~ ti + T2 + T3, where ti, T2 
and Ta correspond to initial filling of the pore, transfer of 
the base from the cis side to the trans side, and finally 
the emptying of the pore, respectively. For the present 
case, Ti << T2, T3. For M < 8, T2 dominates and it 
has no strong dependence on the detailed sequence or 
the orientation of the chain. When the base C enters 
the pore first, T3 increases rapidly with increasing M for 
8 < M < 16, and then saturates to a constant value for 
32 < A/ < 128. On the other hand, T2 is related to both 
the number of the "fringes" and the corresponding max- 
imum time. With increasing M, the former decreases 
and the latter increases. The interplay of all these fac- 
tors leads to a maximum for r as a function of the block 
length M. Similar consideration applies for the base A 
entering the pore first except that here T2 dominates over 

To summarize, we have demonstrated that sequences of 
a driven DNA can be identified from its translocation spe- 
cific characteristics driven through a nanoporc that has 
different affinity for each base. Simulation studies based 
on this attractive nanopore model are in accord with the 
existing experimental data. A stronger attraction for the 
polynucleotide A inside the nanopore leads to a much 
longer translocation time for (polydA)ioo as compared to 
(polydC)ioo- Further analysis explains the shape of the 
histogram of the first passage time, provides an under- 
standing of how translocation time depends on a specific 
sequence, and explains the experimental data of longer 
translocation time for (polydA)5o(polydC)5o compared 
to (polydAdC)5o. Our simulation studies also reveal a 
novel phenomenon that the information for the periodic- 
ity of the block sequences is contained in the periodicity 
of the residence time of the individual nucleotides. 
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